Applying multiple regression models for predicting word duration in a corpus of spontaneous speech

نویسنده

  • Na'im R. Tyson
چکیده

Using word duration as a representative of pronunciation variation, the objective of this research was to delineate a set of variables known to affect word duration and determine the total amount of variation in duration accounted for by them in a multiple linear regression model. More importantly, computing the amount of variation each variable contributes (independently of the others) is crucial in proving its predictive power. Authors such as [1] claim that probabilistic measures such as unigram probability greatly affect whether a word is likely to be reduced in its pronunciation (i.e. the more likely a word is to appear, the greater the chance of it being reduced). However, after performing a regression analysis on word durations from the Variation in Conversation (ViC) corpus of spontaneous speech, and computing partial correlation coefficients of each factor, the results showed that probabilistic measures such as unigram and bigram probability account for less than 1% of the variation in word duration. This finding suggests that the predictive power of certain variables is dependent on the type of corpus being examined — in the case of the spontaneous speech studies in [1], the examined corpus consisted of phone conversations, while the ViC corpus contains monologues.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

How are words reduced in spontaneous speech?

Words are reduced in spontaneous speech. If reductions are constrained by functional (i.e., perception and production) constraints, they should not be arbitrary. This hypothesis was tested by examing the pronunciations of highto mid-frequency words in a Dutch and a German spontaneous speech corpus. In logistic-regression models the "reduction likelihood" of a phoneme was predicted by fixed-effe...

متن کامل

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Homophone duration in spontaneous speech: A mixed-effects model

A recent analysis of a corpus of spontaneous speech (Gahl, 2008) showed that homophone pairs differed in duration as a function of word frequency. For example, the high-frequency word time was shorter on average than its less-frequent homophone twin thyme. This effect persisted when other factors affecting word duration were statistically controlled for in a linear regression model. However, th...

متن کامل

Acoustic correlates of word stress in German spontaneous speech

The acoustic properties of word stress have been explored in a number of studies. However, there is little research on German word stress, and even less on its realization in spontaneous speech. This paper tests whether parameters that have been found to implement word stress in mostly laboratory speech are also employed in a corpus of German spontaneous speech. Specifically, we consider spectr...

متن کامل

Understanding VOT Variation in Spontaneous Speech

This paper reports a corpus study on the variation of VOT in voiceless stops in spontaneous speech. Two speakers’ data from the Buckeye corpus are used: one is an older female speaker with a low speaking rate while the other is a younger male speaker with an extremely high speaking rate. Linear regression analysis shows that place of articulation, word frequency, phonetic context, speech rate a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005